Facial Landmark, Head Pose, and Occlusion Analysis using Multitask Stacked Hourglass

نویسندگان

چکیده

In this study, we proposed a multitask network architecture for three attributes, landmark, head pose, and occlusion, from face image. A 2-stacked hourglass with task-specific heads is the architecture. We also designed auxiliary components network. First feature pyramid fusion module, which plays crucial role in facilitating contextual information various receptive fields. Second interlevel occlusion-aware explicitly fuses intermediate occlusion prediction between subnetworks. The third gimbal-lock-free pose head, outputs rotation matrix 6D representation. conducted an ablative study of these to determine their impacts on Additionally, introduced landmark heatmap scaling approach avoid falling local minima. trained 300W-LP dataset C-CM occlusion. Then, fine-tuned using 300W or WFLW dataset, instead task. This 2-stage training method contributes enhancing detection accuracy that other tasks. experiments, assessed eight test datasets metrics.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stacked Hourglass Networks for Human Pose Estimation

This work introduces a novel Convolutional Network architecture for the task of human pose estimation. Features are processed across all scales and consolidated to best capture the various spatial relationships associated with the body. We show how repeated bottom-up, top-down processing used in conjunction with intermediate supervision is critical to improving the performance of the network. W...

متن کامل

Facial expression analysis robust to 3D head pose motion

Most face expression algorithms assume a front or 'near-to-front' head position. This assumption becomes an important limitation when studying input from real systems. In this article we present a new approach to robustly determine face expression independently of the head pose. Our analysis-synthesis cooperation, possible thanks to the use of a highly realistic 3D head model and the applicatio...

متن کامل

Hough Networks for Head Pose Estimation and Facial Feature Localization

We present Hough Networks (HNs), a novel method that combines the idea of Hough Forests (HFs) [12] with Convolutional Neural Networks (CNNs) [18]. Similar to HFs we perform a simultaneous classification and regression on densely extracted image patches. But instead of a Random Forest (RF) we utilize a CNN which is able to learn higherorder feature representations and does not rely on any handcr...

متن کامل

3D Face Recognition Using Geodesic Facial Curves to Handle Expression, Occlusion and Pose Variations

this paper illustrates the use of radial facial curves on 3D meshes to mode facial deformation caused by expression, occlusion and variation in poses and to recognize faces despite large expression, in presence of occlusion and pose variations. Here we represent facial surface by indexed collection of radial geodesic curves on 3D face meshes emanating from nose tip to the boundary of mesh and c...

متن کامل

Facial Feature Based Head Tracking and Pose Estimation

Main purpose of this thesis is to study a facial feature based human head pose estimation. Especially the facial feature extraction methods suitable for a real-time system are considered. Based on the literature survey, an experimental head tracking and pose estimation system is implemented. An efficient method based on skin detection, gray-scale morphology and a geometrical face model are appl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3262247